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Abstract — Codes over permutations under the infinity norm 
liave been recently suggested as a coding scheme for correcting 
limited-magnitude errors in the rank modulation scheme. Given 
such a code, we show that a simple relabeling operation, which 
produces an isomorphic code, may drastically change the minimal 
distance of the code. Thus, we may choose a code structure for 
efficient encoding/decoding procedures, and then optimize the 
code's minimal distance via relabeling. 

We formally define the relabeling problem, and show that all 
codes may be relabeled to get a minimal distance at most 2. On 
the other hand, the decision problem of whether a code may be 
relabeled to distance 2 or more is shown to be NP-complete, and 
calculating the best achievable minimal distance after relabeling 
is proved hard to approximate. 

Finally, we consider general bounds on the relabeling problem. 
We specifically show the optimal relabeling distance of cyclic 
groups. A general probabilistic bound is given, and then used to 
show both the AGL(p) group and the dihedral group on p ele- 
ments, may be relabeled to a minimal distance of p — O(^plnp). 



I. INTRODUCTION 

FLASH memory is a prominent contender to address the 
increasing demand for dense storage devices. Initially, 
each flash-memory cell was able to store one bit of informa- 
tion. However, a multi-level technology is now common, in 
which each cell stores information by choosing one of q 2 
discrete levels. Hence, each cell can store log2 q bits. 

Flash memories possess inherent problems one has to ad- 
dress in designing such storage device. The problems range 
from data reliability to costly write operations. Recently, 
the rank-modulation scheme was proposed |fT3l in order to 
address specifically these inherent problems. In this scheme, 
the information is stored in the permutation induced by the 
n distinct charge levels being read from n cells. Each cell 
has a ranli which indicates its relative position when ordering 
the cells in descending charge level. The ranks of the n cells 
induce a permutation of {1,2, ... ,n}. 

While this new scheme alleviates some of the problems 
associated with current flash technology, the flash-memory 
channel remains noisy and error correction must be employed 
to increase reliability. In a recent work |l23l, spike-error 
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correction for rank modulation was addressed. Such errors are 
characterized by a limited-magnitude change in charge level 
in the cells, which readily translates into a hmited-magnitude 
change in the rank of, possibly, all cells in the stored permu- 
tation. These errors correspond to a bounded distance change 
in the induced permutation under the £oo-metric. We call 
codes protecting against such errors limited-magnitude rank- 
modulation codes, or LMRM-codes. Throughout the paper we 
will consider only LMRM-codes. 

A similar error model for flash memory was considered not 
in the context of rank modulation in 15], while a different 
error-model (charge-constrained errors for rank modulation) 
was studied in IH, lll4ll . lfT9l . Codes over permutations are 
also referred to as permutation arrays and have been studied 
in the past under different metrics lE], IS, ||6l, flT], Jg], ifTTI . 
1241 . Specifically, permutation arrays under the ^oo-metric 
were considered in ifTTl . We also mention a generahzation of 
the rank modulation scheme which uses partial permutations 
studied in ifTol. IITI. 

A code over permutations, being a subset of the symmetric 
group Sn, may happen to be a subgroup, in which case we 
call it a group code. Group theory offers a rich structure to be 
exploited when constructing and analyzing group codes, in an 
analogy to the case of linear codes over vector spaces. Hence, 
throughout this paper, we focus on LMRM group codes. 

If C and C are conjugate subgroups of the symmetric group, 
then from a group-theoretic point of view, they are almost 
the same algebraic object, and they share many properties. 
However, from a coding point of view these two codes can 
possess vastly different minimal distance, which is one of the 
most important properties of a code. For example, consider 
the following two subgroups of S„, C = {i, and C = 

{i, (1,2)}, where i is the identity permutation and the rest of 
the permutations are given in a cycle notation. The subgroups 
C and C are conjugate but the minimal distance of C and C is 
n — 1 and 1 respectively, which are the highest and the lowest 
possible minimal distances in the i'oo-metric. 

Hence, we conclude that the minimal distance of a code C 
depends crucially on the specific conjugate subgroup. Thus, 
while a certain group code might be chosen due to its group- 
theoretic structure (perhaps allowing simple encoding or de- 
coding), we may choose to use an isomorphic conjugate of the 
group, having the same group-theoretic structure, but with a 
higher minimal distance. We refer to the problem of finding the 
optimal minimal distance among all conjugate groups (sets) of 
a certain group (set) as the labeling problem. 

Apart from introducing and motivating the labeling problem, 
we show that this algorithmic problem is hard. However, we 
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are able to show the existence of a labeUng with high minimal 
distance for a variety of codes, based on the size of the code 
and the number of cycles in certain permutations derived from 
the code itself. 

The rest of the paper is organized as follows. In Section 
mi we define the notation, introduce the error model with 
the associated ^co-metric, as well as formally defining the 
labeling problem. We proceed in Section [III] to introduce two 
algorithmic problems related to the labeling problem, and we 
show their hardness. In Section |IV] we give some labeling 
results on ordinary groups and we present our main result of 
the paper, which gives general labeling results for arbitrary 
codes based on a probabilistic argument. In addition with give 
a few corollaries by applying this result to some well-known 
groups. We conclude in Section |V] with a summary of the 
results and short concluding remarks. 

II. Definitions and Notations 

For any m, n e N, m ^ n, let [m, n] denote the set 
{m,m + 1, . . . ,n}, where we also denote by [n] the set [1, n] . 
Given any n G N we denote by S„ the set of all permutations 
over the set [n] . 

We will mostly use the cycle notation for permutations 
f&Sn, where / = {fo, fi, ■ ■ ■ , fk-l) denotes the permuta- 
tion mapping /, ^ /(,+i)mod)c for ^'^ [0,fc- 1]. We shall 
occasionally use the vector notation whereby a permutation 
/ — [/i,/2, ■•■,/«] G S„ denotes the mapping i i— > for all 
i e [n\. Given two permutations f,gE S„, the product fg is a 
permutation mapping i i— > f{g{i)) for all i £ [n]. 

A code, C is a subset C C S,,. Note that sometimes C will 
also be a subgroup of Sn, in which case we shall refer to C 
as a group code. For a code C and a permutation f E Sn we 
call the code fCf~^ = {fcf^^ : c G C} a conjugate of C. 

Consider n flash memory cells which we name 1,2, ...,n. 
The charge level of each cell is denoted by c, G R for all 
f G [n]. In the rank-modulation scheme defined in |fT3ll . the 
information is stored by the permutation induced by the cells' 
charge levels in the following way: The induced permutation 
(in vector notation) is [/i,/2, ■ • ■ ,/«] iff Cf. > Cf.^^ for all 
ie[n-l]. 

Having stored a permutation in n flash cells, a corrupted 
version of it may be read due to any of a variety of error 
sources (see |]4]). To model a measure of the corruption in 
the stored permutations one can use any of the well-known 
metrics over S„ (see jSl). Given a metric over S„, defined 
by a distance function d : S„ x — t- N U {0}, an error- 
correcting code is a subset of S„ with lower-bounded distance 
between distinct members. 

In lfT4l . the Kendall-T metric was used, where the distance 
between two permutations is the number of adjacent transpo- 
sitions required to transform one into the other This metric is 
used when we can bound the total difference in charge levels. 

In this work we consider a different type of error - a limited- 
magnitude spike error Suppose a permutation / G S„ was 
stored by setting the charge levels of n flash memory cells to 
Ci, €2, ■ ■ ■ ,€„. We say a single spike error of limited-magnitude 
L has occurred in the z-th cell if the corrupted charge level, c'-. 



obeys |cj — cj| ^ L. In general, we say spike errors of limited- 
magnitude L have occurred if the corrupted charge levels of 
all the cells, c[, c'^,---, c',, obey 



max C; 



^ L. 



Denote by /' the permutation induced by the cell charge 
levels Cj, Cj, . . . , under the rank-modulation scheme. Under 
the plausible assumption that distinct charge levels are not 
arbitrarily close (due to resolution constraints and quantization 
at the reading mechanism), i.e., |c; — Cy| I for some positive 
constant ^ G IR for all i 7^ a spike error of limited-magnitude 
L implies a constant G N such that 



max 

16 [nl 



< d. 



Loosely speaking, an error of limited magnitude cannot change 
the rank of the cell i (which is simply f^^{i)) by d or more 
positions. 

We therefore find it suitable to use the i!oo-metric over Sn 
defined by the distance function 



^if'g) = max|/(z) 
I e n] 



for an f,ge Sn- Since this will be the distance measure used 
throughout the paper, we will usually omit the 00 subscript. 

Definition 1 . A limited-magnitude rank-modulation code 
(LMRM-code) with parameters {n,M,d), is a subset C C Sn 
of cardinality M, such that dco{f,g) ^ d for all f,gEC, 
f ¥^ g- '^ii^ sometimes omit the parameter M.) 

We note that unlike the charge-constrained rank-modulation 
codes of 111 4,1 . in which the codeword is stored in the per- 
mutation induced by the charge levels of the cells, here the 
codeword is stored in the inverse of the permutation. 

Permutation codes under the £co -metric have been studied 
before in lfT6l . ||231 . The size of spheres in this metric has 
been studied in ifTSl . Il20l . and the size of optimal anticodes 
in l22l . 

For a code C we define its minimal distance and denote it 
by d{C) as 

d{C)= mm d{f,g). 

A labeling function is a permutation I E Sn- A relabeling of 
a code C by a labeling Z G S„ is defined as the set ICI~^- We 
say that the code C has minimal distance d with a labeling 
function I when 

d{lCl-^) = d. 

It is wefl known (see ID) that the ^co-metric over S„ is only 
right invariant and not left invariant, i.e., for any f,g,hE Sn, 
d{f,g) = d{fh,gh), and usually d{f,g) 7^ d{hf,hg), thus 
we would expect that in many cases d{C) 7^ d{lCl^^)- 
Therefore, the questions of which labeling permutation leads to 
the optimal minimal distance, and what is the optimal minimal 
distance, rise naturally in the context of error-correcting codes 
over permutations under the infinity metric. Note that I is 
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called a labeling function because for a permutation in cycle 
notation / = (flj, . . . . . . . . ,fl„) we get 

lfr' = {l{a^),...,l{a,J)...{l{ak.+r),...,lM). 

The labeled permutation Ifl^^ has the same cycle structure as 
/ but the elements within each cycle are relabeled by I. 

By virtue of the right invariance of the £oo-metric, we shall 
assume throughout the paper that any code C C S,, contains 
the identity permutation, since right cosets of C preserve the 
distances between codewords, and one of the cosets contains 
the identity. Furthermore, 

d{C)= mm d{gh~\i), 

g,h e Cg^h 

where i is the identity element of S;,, and where the distance 
from the identity shall be called the weight of the permutation. 
This makes it easier to calculate the minimal distance of a 
group code since gh^^ simply goes over all the codewords. 

More specifically, we will explore the case where C is a 
subgroup of Sn and ask which conjugate group of C has the 
largest minimal distance. We denote by £jnin(C) (£max(C)) 
the minimal (maximal) achievable minimal distance among all 
the conjugates of a code C. 

III. The Labeling Problem is hard to approximate 

In this section we define two algorithmic problems regard- 
ing the labeling of codes, and show that they are hard to 
approximate. We shall begin by showing that for any code 
C, £min(C) ^ 2, which means that the minimal distance of 
a code depends crucially on its labeling. We then continue by 
showing the decision problem of whether £max(C) ^ 2 is NP- 
complete, while finding out £max(C) is hard to approximate. 

Recall the conjugacy relation over Sn- Two permutations 
g,fE Sn are said to be conjugate if there exists Sn such 
that hgh^^ = f. Conjugacy is an equivalence relation, and 
its equivalence classes are called conjugacy classes. Let T = 
{Ci,C2, . . -/C/t} be the set of conjugacy classes of S„. It is 
known that two permutations have the same cycle structure if 
and only if they share the same conjugacy class. Denote by 
B{i,r) the ball of radius r centered at the identity, 

B(/,r) = {feSn:d{f,i)^r}. 

The following lemma will help us show that any code C has 
a "bad" labeling, i.e., a labeling with minimal distance 1 or 2. 

Lemma 2. For any n G N there is a permutation f composed 
of a single n-cycle, i.e., f = {uq, a^, . . . , fl„_i ) G S„, such that 
Wi - fl(;+i) mod n\ ^ 2 for all i e [0, « - 1]. 

Proof: The proof is by induction. For n = 1,2,3 all n- 
cycles in Sn satisfy the claim. We assume the claim holds 
for n, and prove it also holds for n + 1. By the induction 
hypothesis there is / = {aQ,ai,. . .,fl„_i) G S„ that satisfies 
the claim. W.l.o.g., we can assume that = n — 1, 

flg = n, and ai = n — 2, otherwise /^^ would satisfy 
these conditions. Set fl„ = n + 1 and the permutation /' = 
(flO/^l/- • -/fl/i-i/fln) G S„+i satisfies the claim. ■ 



Corollary 3. Let C be any conjugacy class ofSn, then 

B(/,2)nC 7^0. 

Proof: Every conjugacy class of S„ is uniquely defined 
by the set of its cycles' lengths. Let {ni, ^2 . . . , n^t} be the 
cycles' lengths of the permutations in C, where I^f^j = n. 
By Lemma |2] we conclude that there exists some / G Cj such 
that 

/ = {a\, a\,..., a\ ) {a\, aj, . . . ,al^) . . . {a\, a\,..., a^^ ), 

where for each i, the set {a'jjjL-^ = [1 + E^ii "m/Em=i "m] 
and the cycle {a\,a2,...,al^.) satisfies Lemma |2] One can 
easily check that d{f,i) ^ 2, thus / E B(/,2). ■ 
Now we are ready to prove that any code C has a "bad" 
labeling. 

Theorem 4. For any code C C S„, |C| ^ 2, there exists a 
labeling of the elements such that the minimum distance is 
at most 2, i.e., there exists I G Sn such that d{lCl^^) ^ 2. 
Moreover, C has a labeling with minimal distance 1 if and 
only if the set {ab~^ : a,b(zC} contains an involution (a 
permutation of order 2 ). 

Proof: Let / G C, / 7^ be a permutation whose cycles' 
lengths are {ni, n2 . . . , tij^} and where 

/ = {a\, a\,..., a\ ) {a\, aj, . . . ,al^) . . . {4, a\,..., a\). 

By Corollary |3] there exists / gB(/, 2) with the same cycle 
structure as /. Let I E Sn be the permutation that conjugates 
/ to / , i.e., Ifl^^ = f . Therefore, 

d{lCr^) <: d{hl-^,]fl-^) = d{i,f') ^ 2. 

We note that the only permutations of weight 1 are invo- 
lutions in Sn, and that any involution in S„ may be easily 
relabeled to be of weight 1. Hence, C has a labeling with 
minimal distance 1 if and only if the set {ab^^ : a,bEC} 
contains an involution. ■ 

After proving that the worst labeling satisfies £inin(C) ^ 2 
for all C C Sn, we turn to consider the best labeling. We show 
that the algorithmic decision problem of determining whether 
a certain code C has £max(C) = 1 or £max(C) ^ 2 is NP- 
complete. 

2-DISTANCE PROBLEM: 

• INPUT: A subset of permutations C C Sn given as a list 
of permutations, each given in vector notation. 

• OUTPUT: The correct Yes or No answer to the question 
"Does C have a labeling that leads to a minimal distance 
at least 2, i.e., is £max(C) ^ 2? ". 

We start with a few definitions. For a code C C Sn, define 
its associated set of involutions as 

I{C) = {g E Sn : g^ = i, g = ab-^^i, a,bEC}. 

For any gEl{C) we define a set of edges, E{g), in the 
complete graph on n vertices, K„, where the vertices are 
conveniently called 1, 2, . . . , n, as 

E(^) = {uvEE{Kn) : g{u) = v,u^v}. 
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Recall that a Hamiltonian path in an undirected graph G is 
a path which visits each vertex exactly once. The following 
theorem shows an equivalence between the property of a code 
having a labeling with minimal distance at least 2 and the 
existence of a certain Hamiltonian path in the complete graph 
K„. 

Theorems. Let C C S„ be a code, then £max(C) ^ 2 if and 
only if there exists a Hamiltonian path in K„ which does not 
include all the edges E{g), for any gE I{C). 

Proof: Recall that d{C) = min^ /, gcy-^/; i) and 

note that any permutation which contains a cycle of length 
3 or more is at distance at least 2 from the identity. Hence, 
we only have to make sure the set of involutions, I{C), has 
distance at least 2 from the identity. 

If such a Hamiltonian path, a^, a2, ■ ■ ■ , cin, exists in Kn, then 
use this path as the labeling permutation and label the element 
flj as i, i.e., the labeling permutation / G S„ satisfies /(a,) = i 
for all f G [n]. For any ^ G / (C) we know that there exists some 
uv E E{g) which does not belong to the Hamiltonian path in 
K„, and therefore \l{u) — l{v)\ ^ 2. From the definition of 
E{g) we get that ^(m) = v, and so d{lgl^^,i) ^ 2. 

For the other direction, let / G Sn be a labeling such that 



d{lCl 



^ 2. We now consider the Hamiltonian path 



Z-l(l),/"^(2),...,/-l(n) in Kn. By our choice of /, for 
any gG I{C) there exists u,v G [n] such that g{u) = v and 
\l{u) — l{v) \ ^ 2. Hence, the edge uv does not belong to the 
constructed Hamiltonian path in X„. ■ 
By the last theorem we conclude that any algorithm that 
finds a labeling of C with minimal distance at least 2, actually 
finds a Hamiltonian path in K„ which does not include all the 
edges E{g), for any g E I{C). We are now able to show that 
the 2-DISTANCE problem is NP-complete. 

Theorem 6. The 2-DISTANCE problem is NP-complete. 

Proof: First, we show that 2-DlSTANCE is in NR For 
any given verifier, / G Sn, which is a labeling function, we 
compute the distance between ( and all the elements of 1(C). 
Note that |I(C)| ^ |Cp and constructing J(C) may be easily 
done in polynomial time. Thus, the question can be verified 
in polynomial time. 

In order to verify the completeness we shall reduce the 
HAMILTONIAN-R\TH problem (see Qll) to our problem. 
Let G{V,E) be a graph on n vertices (given as an n x « 
adjacency matrix) in which we want to decide whether a 
Hamiltonian path exists. Define the code 

C = {{u,v) : uv i E}U {/} C S,„ 

where (m, v) is the permutation that fixes everything in place 
except commuting the elements u and v. Obviously, we can 
construct C from G in polynomial time. We then run the 2- 
DISTANCE algorithm on C and return its answer 
We observe that 

1(C) = {{u,v){k,l) : {u,v),{k,l)EC,{u,v}n{k,l} =0} 
UC\{/}. 

If fli,fl2/ ■•-,'?« is a Hamiltonian path in G, then it is also 
a Hamiltonian path in Kn not containing all of E{g), for any 



gEl{C). This is true because E{g) only contains edges that 
are not in E. 

For the other direction, if there is a Hamiltonian path in Kn 
which does not include all the edges of E{g) for any g E I{C), 
then, in particular, this path does not include all of E{g), g eC, 
g I. Since for any such g = {u,v) gC, E{g) = {uv}, and 
uv i E, this path is also a Hamiltonian path in G. ■ 

We now define a harder algorithmic question and deduce 
by Theorem |6] that this problem is hard to approximate. 

OPTIMAL-DISTANCE PROBLEM: 

• INPUT: A subset of permutations C C S„ given in vector 
notation. 

. OUTPUT: The integer £max(C). 

For a constant e > 1 we say the problem may be e- 
approximated if there exists an efficient algorithm that for any 
input C computes /(C) which satisfies 

^/:max(C) ^ /(C) ^ e£max(C). 

Corollary?. For any constant 1 < e < 2, the OPTIMAL- 
DISTANCE problem cannot be e-approximated unless P = 
NP. 

Proof: Assume there exists an efficient algorithm com- 
puting /(C) G N which is an e-approximation of £max(C). If 
£max(C) = 1 then /(C) < 2 and so /(C) ^ 1. If, however, 
£max(C) ^ 2, then /(C) > 1. Thus, given such an efficient 
algorithm exists, we can decide whether £max(C) ^ 2, i.e., 
efficiently solve the 2-DISTANCE problem. By Theorem|6]we 
know that the 2-DISTANCE problem is NP-complete, and so 
P = NP. m 

IV. Constructions and Bounds 

In the previous section we have shown that the 2- 
DISTANCE and OPTIMAL-DISTANCE problems are hard. 
We are therefore motivated to focus on solving and bounding 
the latter problem for specific families of codes, and in 
particular, codes that form a subgroup of the symmetric group 
S„. The rich structure offered by such codes makes them easier 
to analyze, in much the same way as linear codes in vector 
space. Furthermore, knowing good labelings for certain groups 
is of great interest since one can use them as building blocks 
when constructing larger codes (see for example the direct and 
semi-direct product constructions in ll23l ). 

A. Optimal Labeling for Cyclic Groups 

The most simple basic groups one can think of are cyclic 
groups. Recall that for a cyclic group G there is an element 
gE G such that G is generated by the powers of g, i.e., G = 
{g^ : k E N}. We also recall that a group G acting on [n] is 
said to be transitive if for every a,b E [n] there exists g EG 
such that g{a) = b. The following theorem gives an exact 
optimal labeling for transitive cyclic groups over the set [n] . 

Theorem 8. Let C C S,, be a transitive cyclic group over the 
set [n] , then the optimal minimal distance for C is 



£max(C) — n 



y/^n - 3 - 1 
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Proof: Let / = {a^, Uj, ■ ■ ■ , a„) E C he a generatoiQ of C, 
and let d be an achievable minimal distance, i.e., there is a 
labeling / such that d(/C/"^) = d. Denote C = ICl^'^, then 
/' = Ifl^^ = {l{ai),l{a2), . ■ . ,l{a„)) is a generator of C. 
Define 

B = {{x,y)e[n] X [n] : |x-y| ^ d}. 

From the minimal distance of C' we know that for any g G C, 
g ^ L, d{g, l) ^ d. Hence, there is at least one pair {x,y) G B 
such that g{x) = y. On the other hand, C is cyclic and 
transitive and so is C, so for any pair (x, y) GB there is 
exactly one g G C such that g{x) = y. It follows that 

|C'\{/}| =«-! ^ |B| = {n-d){n-d + l). 

Solving the inequality and remembering that d is an integer, 
we get 

a ^ n 



In order to show the upper bound is achievable, conveniently 
denote k = [(V4n 



3 — 1 ) /2 1 and define the sets 
A2= [k + l,n-k\, A3 = [n-/c + l,n]. 



We define the following labeling Z G S„, 

1) First set /(a,) = / for all i G Ai. 

2) Then set /(fl(„+i_/)(2;c-„+,)/2+l) = ' for all i G A3. 

3) Finally set Z(fly) = f for all f G A2, where / is chosen 
arbitrarily from the left-over indices. 

We will show that for any s G [n — 1], d{f^, /) ^ n — k. Note 
that it is enough to show the claim for s ^ \n/2] since if 
s > \n/2] then by the right invariant property d{f^,i) = 

d{i,D = 

Let s G [[n/2]], and note that 



! = 1 



V4n - 3 - 1 



V4m -3 + 1 



1 V4n -3-1 V4n -3 + 1 

2 2 2 

4« - 4 



n-1 



However, since YJi=i ^ is an integer we get that 



k 

i=l 



"n — 1" 
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Thus, let m G [k] be the smallest integer such that 

/=0 



Hence 



m{lk-m + 1) 



s + l^fc-m + 1. 



(1) 



From labeling rule |2] we get that 



am{2k-m+i) , , = n - m + 1, 
2 1"^ 



and from labeling rule [T] 



m(2k-m+l) ^ 
2 S+i 



milk — m ■ 



■s + 1 



and so 



dif,i)=m^x\f{i)-i\ 

1 e [n] 

m{2k — m ■ 



m{2k-m + 1) 
2 

"'P'^-'"+l)^s+l 



-s + 1 
(fc-m + 1) 



(2) 



'a single-cycle generator must exist since C is transitive. 



am{2k-m+i) - (k-m + l) 

2 ri 

= \n — m + 1 — [k — m + 1)\ 
= n — k, 

where (|2]l follows from ([T). ■ 
Since the labeling of indices in A2 is arbitrary, we actually 
have [n — 2k) I different good labelings resulting from the 
theorem. 

Example 9. Applying Theorem [8] for the case n = 10 we get 
that k = 3, and the optimal minimal distance is £max(C) = 
n — k = 10 — 3 = 7. Moreover, such a labeling is a\ = 1, 
fl2 = 2, fl3 = 3, fl4 = 10, flg = 9, 07 = 8, and one of the cycles 
that generates the cyclic group of minimal distance 7 is 

(1,2,3,10,4,9,8,5,6,7). 



B. The Neighboring-Sets Method 

In this section we present a general method we call the 
neighboring-sets method. With this method, lower and up- 
per bounds on >Cmax(C) may be obtained provided certain 
neighboring sets of indices exist. We shall first describe the 
general method, and then apply it, using further probabilistic 
arguments, to show strong bounds on £max(AGL(p)) where 
AGL(p) is the affine general linear group of order p, as well 
as £max(D„), where D„ is the dihedral group of order n. 

We start by recalling the definitions of D„ and AGL(p) 
and dispensing with small parameters, for which we can give 
exact bounds. 

Definition 10. For n G N, the dihedral group of order n, de- 
noted D„ is the group generated by the two permutations 

Dn= {{1,2,..., n), {l,n)i2,n -!)...{ [n/2\ ,\n/2'])). 

We refer to the labeling of D„ described in the definition 
above as the natural labeling of D„. 

Definition 11. Let p G IN be a prime, then AGL(p) is defined 
by the subgroup of permutations that acts on the set [0, p — 1] 
and is generated by the permutations f {x) = x + 1 andg{x) = 
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ax, where all calculations are over GF(p) and a is a primitive 
elemen t in GF ( p ) . 

Throughout we shall consider only AGL(p) for p ^ 3. 
Like before, we refer to the natural labeling of AGL(p) as 
the labeling derived from the permutations / and g described 
above. For example, the natural labeling of AGL(5) is the 
group generated by the permutations (in cycle notation) / = 
(0, 1, 2, 3, 4) and g = (1, 2, 4, 3) . The following theorem gives 
us the minimal distance of the natural labeling of AGL(p). 

Theorem 12. For any prime p ^ 3, AGL(p) with the natural 
labeling has minimal distance (p — l)/2. 

Proof: Because AGL(p) is a group and the metric is 
right invariant it suffices to check only the distances from the 
identity permutation. Let c;, be the permutation d}, : x ^ x + h 
for some h E [l,p-l]. If b ^ {p - l)/2 then \a^{0) -0| ^ 
(p-l)/2. Otherwise, \(Tj,{p - 1) - {p - 1)\ ^ (p-l)/2. 
Thus, in any case, d{ai„i) ^ (p — l)/2. 

Let T G AGL(p) be an arbitrary permutation of the kind 
t(x) = ax + b where a ^ 1. Both of the permutations 
'^(p-l)/2 represent lines in the affine plane with dif- 

ferent slopes, and so there exists xq G [0, p — 1] such that 
t(xo) = cr(p_i)/2(^o)- Hence, \t{xo) - Xo\ ^ (p - l)/2 
and then d{T,i) ^ (p — l)/2, which concludes the proof. ■ 

The next theorem shows that the natural labeling is optimal 
for any prime p < 8. 

Theorem 13. For any prime 3 ^ p < 8, 

rmax(AGL(p)) = 

Proof: Let I be the set of involutions of AGL(p). It is 
easy to verify that any permutation g G J is of the form g{x) = 
—x + b for some b G GF(p), and so | /| = p. We note also that 
for any Xi,X2 G GF(p) there is exactly one involution ^G / 
such that g{xi) = X2 (finding g is by solving the equation 

X2 = —X\ + V). 

Assume that we have a labeling of AGL(p) with minimal 
distance more than the natural minimal distance. In particular, 
with this labeling every involution has minimal distance at 
least (p + l)/2 from the identity permutation. Let 



B = < : G GF(p), |x — yl 



p + 1 



Now, for any gEl there is at least one unordered pair 
{x,y} G B such that g{x) = y. It follows that 



IBI 



^ III 



Solving the inequality we get p^4 + vl7>8. ■ 
We can get a very similar result (which we omit) regarding 
the distance of the natural labeling of the dihedral group D„, 
showing it to be approximately n/2. 

It is tempting to assume that for large p and n we can 
get labelings for AGL(p) and D„ with normalized distance 
tending to 1, by virtue of their size alone: |D„| = 2n and 
|AGL(p)| = p(p — 1), both vanishing in comparison to the 



size of S„ and Sp, respectively. However, a simple example 
of a code 

C = {/}u{/(l,2)/-i:/GS„} CS„ 

dispels this thought since |C| = n{n - l)/2 + 1, d{C) = 1, 
and for any / G S,i we have ICl^^ = C, so relabeling does 
not change the code's distance. Thus, we turn to describe the 
neighboring-sets method which will attain better results for 
AGL(p) and D„. 

Definition 14. Let C C S,, be any set of permutations acting on 
[n\. Two disjoint subsets A, B C [n] are called C-neighboring 
sets if for any f eC, f i, the following holds 

(/(A)nB)u(/(B)nA) 7^0. 

We define 0{C) to be the smallest integer 0{C) = \A\ + |B|, 
where A and B are C-neighboring sets. If there are no such sets 
then we define 0{C) = oo. 

First we show that if C is a group then, 0(C) is closely 
related to its optimal minimal distance. 

Theorem 15. Let C C S„ be a group that acts on [n] with 
0(C) < 00, then 

n-0(C) + l ^£max(C). 



Moreover, if £max(C) ^ j then also 



<{C) ^ n 



0(C) 



Proof: Since 0(C) < oo there exist C-neighboring sets 
A,B C [n] such that |A| + |B| = 0(C). Let the labeling 
function / G S„ be such that /(A) = [1, |A|], and /(B) = 
[n — |B| + 1, n]. It is trivial to check that ICI~^ has minimal 
distance n - 0(C) + 1 d{C). 

For the other inequality, assume that the labeling I of C gives 
the optimal minimal distance, d[lCl^^) = >Cmax(C) ^ |. It 

follows that n — >Cmax(C) < Cmax{C) + 1, SO A = [1, M — 

'Cmax(C)], and B = [>Cmax(C) + are two disjoint sets. 
We will show that A and B are C-neighboring sets. 

For any n — >Cmax(C) < i < ^max{C) + 1, if such 
i exists at all, and for any f ElCl^^, f 7^ i, we have 
|/(i) - z| < £max{C). However, d{f,L) ^ £max(C) and so 
necessarily (/(A) n B) U (/(B) n A) 7^ 0. Thus, A and B 
are C-neighboring sets. Hence, 0(C) ^ 2{n — >Cmax(C)), and 
the result follows. ■ 

It is pointed out in the definition that some groups C C S„ 
might have 0(C) = 00, e.g., 0(S„) = 00. The following 
theorem shows that for any prime p > 5, 0(AGL(p)) is 
finite while also showing a lower bound. 



Theorem 16. If p 

prime p ^ 7, 



0(AGL(p)) ^ max|y'2(p 

For primes p ^ 37 we also have 

0(AGL(p)) p. 



3,5, then 0(AGL(p)) = 00. For any 



1),6 
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Proof: We first start with the lower bounds. It is 
well known that AGL(p) is 2-transitive, i.e., for any 
(fl, fo), (c,rf) e [0, p — 1]^, a ^ b, c d, there exists 
/e AGL(p) such that /((fl,fc)) = (c,d). If 0(AGL(p)) ^5 
and A and B are AGL(p) -neighboring sets then, w.l.o.g., we 
can assume that \A\ ^2. Hence there exists fE AGL(p), 
/ 7^ i, such that f{A) = A which contradicts the fact that 
A and B are AGL(p) -neighboring sets. As a consequence we 
also get that 0(AGL(3)) = 0(AGL(5)) = oo. 

The second lower bound is based on a counting argument. 
AGL(p) contains a permutation / composed of one cycle 
of length p. For any i E [p — 1] there exists at least one 
{k,m) e{Ax B)U{B X A) such that f'{k) = m. On the 
other hand, for any {k,m) e{A x B)U {B x A) there exists 
only one i E[p — 1] such that /' (fc) = m. Thus, 



p - 1 ^ |(A X B) U (B X A)| = 2\A\ ■ \B\ 



(3) 



and the result follows because the minimum of 0(AGL(p)) = 
\A\ + \B\ given by © is i/2(p - 1). 

For the upper bound we will show that there are AGL(p)- 
neighboring sets A,B C [0, p — 1] of sizes (p — l)/2 and 
(p + 1) /2, respectively, and thus 0(AGL(p)) ^ p. We note 
that A and B of the appropriate sizes are neighboring sets 
if and only if /(A) 7^ A for all / 7^ /. We shall therefore 
try to bound the number of such "bad" subsets A. Assume 
AC [0, p - 1], |A| = and / E AGL(p), / 7^ l Then 
/(A) = A iff A is a union of cycles of /. We define a 
polynomial which is related to the cycle-index polynomial of 
/ as 

Zf{x) = Yi{i+xr'^fl 

i 

where is the number of cycles of / of length i. It 

follows that the number of "bad" sets A for / is the coef- 
ficient of x(P~^)/^ in Zjr{x). Summing over all permutations 
/ E AGL(p) except the identity permutation will upper bound 
the number of such "bad" sets in AGL(p). 

The group AGL(p) is a disjoint union (except for the 
identity) of p groups which are: the cyclic group of order p 
generated by (0, 1, . . . , p — 1), and p — 1 cyclic groups gen- 
erated by a permutation of the form (aq/ i^i, ■ ■ ■ 
Since, in a cyclic group of order I, for each i\t there are (p{i) 
elements of order i, where (p is Euler's totient function, we 
can define the polynomial "Z. p^Q-y^i^-p^^ (x) and readily verify that 

2agl(p)(^) = E ^f^^) = 

fe AGL(p),/^/ 

= (p-i)(i + xn+ E p<?(0(i + ^)(i + ^')'^- 

i\p-l 

i>l 

We shall now upper-bound the coefficient fl(p_i)/2 of 

^(p-l)/2 2, 



Ip-l 



-AGL(p)' 



2i\p-l 
i>l 



p-1 
i 

p-1 
2i 



— 3 — 



where the upper bound is derived by upper bounding (p{i) ^ p, 
upper bounding the central binomial coefficient using ifTSl . and 
taking at most p summands. 

On the other hand, the number of subsets of [0, p — 1] of 



size (p — l)/2 is exactly 



(p-l)/2. 



(p-l)/2' 



> 



). One can easily verify that 



for all primes p ^ 37. Thus, there are sets A such that /(A) 7^ 
A, as required. ■ 

Example 17. Let p = 7. By Theorem [16] we have the lower 
bound 0(AGL(7)) ^ 6, and indeed the sets A = {0,1,2}, 
B = {4,5,6} are AGL{7) -neighboring sets. Furthermore, 
by Theorem \T5\ we get that 7 - 0(AGL(7)) + 1 = 2 < 
'Cmax(AGL(7)). However, by Theorem [13] we know that 
rmax(AGL(7)) = 3. □ 

The following theorem is our main result of this section. It 
gives a generic labeling result for a code C over the set [n] 
based solely on the size of the code and the number of cycles 
in the set of permutations {gh^^ : g,hEC}. 

Theorem 18. Let C C S„ be a code. If there exist p,t E'R, 
< p < 2, and t > 0, such that 

e-4 +e-"P'/('-P) ^ e^(/)pV(i-p) < 1^ (4) 
f=gh-^ 

g,h€C,gy^h 

where c{f) is the number of cycles in the permutation f, then 
there exists a labeling I E Sn such that 

£max(C) ^ d{ia-^) + [2pn + t\ . 

Proof: We use a probabilistic argument to show such 
a labeling exists. We partition the set [n] into three disjoint 
sets. A, B, and C, according the probabilities P(f€ A) = p, 
P{i EB) = p, and P{i EC) = 1 — 2p, where elements are 
placed independently. 

Assume first that f E Sn is a single cycle, i.e., / = 
(flQ/ cii, ■ ■ ■ , ^k-l)- We define the events 

D,(/) = {«,■ E A and E B ot a, E B and fl/+i E A} , 

for each i E [0,k — l], and where the indices are taken modulo 
k. Where it is clear from context, we shall write D, for short. 
We also define the event D| to be that A and B are {/}- 
neighboring sets. 

We would like to evaluate the probability that A and B 
are not {/} -neighboring sets, i.e., the probability P{Df) = 
P{r\\zl'D'i). It is easy to calculate that 



P{D, 



1 - 2p2. 



Furthermore, for all z G [0, — 1] we denote 



p, = P(D,|Do,...,D,_i). 
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We find the following recursion, for all i E [0,k — 3]: 



i+i = P(D,+i|Do,...,D,) 
= P{ai+ieC\D^,...,D^) 

+ P{ai+i i C|D^,...,D;) 

■P(D^|D^,...,D^,fl,+i i C) 
= P(fl,+ieC|D;^,...,D;) 
+ P(fl,+i i C|D^,...,D;)- (1-p). 



In addition. 



P(fli+ieC|Do,...,DO 



P(fl,+^_qDo,...,D,-,i) 

P(D,|Do,...,D,_i) 
■P(D,|Do,...,D,_i,fl,+ieC) 
1 -2p 



It follows that for all e [0,A: - 3], 
po = l-2p2 



1 -2p 



It is easily seen that for all / G [0, fc — 2], p,- ^ 1 — p, and so 
for all ze [0,fc-3], 

P/+1 = 1 - p + p ^ 1 



1-p 



1-p 



Furthermore, since < p < j, 

po = 1 - 2p2 <C 1 - 
Combining the above, we get that 

P(D7) = P(nJto'D;) 



!=0 

*;-2 



2 \ ''^-1 



;=0 

^ g_(l:-l)p2/{l-p) 

since 1 — x ^ e^^ for all x e R. 

Let g G Sn be a general permutation, with cycles' lengths 
. . . , /jf, and Y^i^x li = n, then the probability that A and 
B are not {g^}-neighboring sets is, 

P(D^) ^ J^e-(',-i)pV(i^P) = g-{n-/c)pV(i-p)_ 

! = 1 

Let S = |A| + |B| = Xi + X2 H h X„, where X,- is 

the indicator random variable for the event a, G A U B. By the 



union bound 



U DfU{S;^E{S) + t} 



< 



I 



\g,hec,g^h 

f=gh-^ 

g,heC,gy^h 
S,f'eC,g^h 

< h 

where P(S ^ E(S) + t) was upper-bounded using Hoeffding's 
inequality. 

Therefore, with positive probability neither of these events 
occur, i.e., there is a labeling for C such that for any h,g eC, 
h ^ g, A and B are -neighboring sets and S = |A| + 

|B| ^ £(S) + f = 2pn + f, and the resuh follows. ■ 

Note that when C forms a subgroup of S;, then the summa- 
tion in equation (|4]i is done only over the elements of C \ {/}. 
Theorem[T8]easily gives us achievable-labeling results for any 
subgroup of Sn only by knowing the number of cycles in each 
of its elements. 

We say that a G [n] is a fixed point of a permutation f E Sn 
if f{a) = a. The minimal degree of a subgroup C C Sn 
is the minimum number of non-fixed points among the non- 
identity permutations in C. The following corollary connects 
the minimal degree of a group and an achievable distance by 
applying Theorem [TS] 

Corollary 19. Let C be a subgroup of S„ with minimal degree 
d, such that there exist t > 0, < p < j, satisfying 



e-— + \C\e 2(i-p) < 1, 

then C has a labeling 1 E Sn with 

d{lCr^) ^n + l- [lpn + t\ . 

Proof: If C has minimal degree d, then the number of 
cycles of any g G C, g 7^ /, is at most n — j and the claim 
follows by Theorem [18] ■ 
We now proceed to show strong bounds on £max(AGL(p)) 
and £max(D„). 

Theorem 20. For q, a large enough prime. 



q - 0{yjq\nq) ^ Cmayi{AGL{q)) ^ q ■ 



Proof: For the upper bound we simply note that a 
transitive cyclic group of order is a subgroup of AGL{q), 
and then use Theorem [8] For the lower bound we recall that 
AGL{q) is sharply 2-transitive, hence, its minimal degree is 
q-l.Ey Corollary [19] 



+\AGL{q)\e ^(T^ ^ e ■/ + q^e"^ 
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For t = ^/q]n{q + 1) and p 



_2f^ , {1-1)P^ 1 



41n(t,+l) 

^1:1 . we get 

^2 



< 1. 



We note that for q large enough, p < It follows that 
£max(AGL(ij)) ^ + 1 - L2<?p + fj 



- 0{^/q\r\q). 



Theorem 21. For the dihedral group, D„,n ^ 37, 



n - 0(Vnln«) ^ >Cmax(D„) ^ « 



V4m - 3 - 1 



Proof: For the upper bound, again we note that a transitive 
cyclic group of order n is a subgroup of D„ and then use 
Theorem [8] For the lower bound, we know that | D„ \ = 2n, 
and that D„ has minimal degree d ^ n — 2 (it is « — 2 for 
even n, and n — 1 for odd n). We use Corollary [19] with 



t 



and get 



nln(2M + 2) 



/ln(2M + 2) 



e " +\D„\e 2(i-p) ^ e " 



nil 



Ine 2 



In 



2n + 2 2n+2 
It is easy to verify that p < \ for all n ^ 37. Thus, 

>Cmax(D„) + [2pn + t\ 



< 1. 



^ n — 2«i 



/ln(2n + 2) _ Jnln{2n+2 
n/2 - 1 



n — 0{Vn Inn). 



V. Summary 

In this work we examined the relabeling of permutation 
codes under the infinity metric. While relabeling preserves 
the code structure, producing an isomorphic code, it may 
drastically reduce or increase the relabeled code's minimal 
distance. 

We formally defined the relabeling problem and showed 
that all codes may be relabeled to get a minimal distance of 
at most 2. Deciding whether one can relabel a given code to 
achieve minimal distance 2 or more was shown to be an NP- 
complete problem. In addition, calculating the best minimal 
distance achievable after relabeling was shown to be hard to 
approximate. 

We then turned to bounding the best achievable minimal 
distance after relabeling for certain groups, and in particu- 
lar, cyclic groups, dihedral groups, and affine general linear 
groups. For cyclic groups, an exact solution and relabeling was 



shown. For the other two families of groups, a probabilistic 
method was used to give a general bound which turned out to 
provide strong bounds on the relabeling distance. 

Finding out how the best achievable minimal distance after 
relabeling depends on certain group properties, and finding 
its exact value for other well-known groups, is still an open 
problem. 
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